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To what extent do general features of folding/unfolding kinetics of small globular proteins follow 
from their thermodynamic properties? To address this question, we investigate a new simplifed 
protein chain model that embodies a cooperative interplay between local conformational preferences 
and hydrophobic burial. The present four-helix-bundle 55mer model exhibits proteinlike calorimetric 
two-state cooperativity. It rationalizes native-state hydrogen exchange observations. Our analysis 
indicates that a coherent, self-consistent physical account of both the thermodynamic and kinetic 
properties of the model leads naturally to the concept of a native state ensemble that encompasses 
considerable confomational fluctuations. Such a multiple-conformation native state is seen to in- 
volve conformational states similar to those revealed by native-state hydrogen exchange. Many of 
these conformational states are predicted to lie below native baselines commonly used in interpreting 
calorimetric data. Folding and unfolding kinetics are studied under a range of intrachain interaction 
strengths as in experimental chevron plots. Kinetically determined transition midpoints match well 
with their thermodynamic counterparts. Kinetic relaxations are found to be essentially single ex- 
ponential over an extended range of model interaction strengths. This includes the entire unfolding 
regime and a significant part of a folding regime with a chevron rollover, as has been observed for 
real proteins that fold with non-two-state kinetics. The transition state picture of protein fold- 
ing and unfolding is evaluated by comparing thermodynamic free energy profiles with actual kinetic 
rates. These analyses suggest that some chevron rollovers may arise from an internal frictional effect 
that increasingly impedes chain motions with more native conditions, rather than being caused by 
discrete deadtime folding intermediates or shifts of the transition state peak as previously posited. 
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I. INTRODUCTION 

Our physical knowledge of protein folding is mea- 
sured by the extent to which current ideas of elemental 
polypeptide interactions are capable of reproducing ex- 
perimental data. Tremendous experimental progress has 
been made in the recent past.^~^ During the same time, 
investigations of simplified self-contained polymer models 
have contributed much physical insig hiJ-^^ These mod- 
els are successful in physically rationalizing many general 
features of proteins in terms of polymer properties, build- 
ing a foundation for future advances. To move forward in 
theoretical development, it is necessary to recognize what 
common protein properties have not been predicted by 
models to date and target our efforts towards rectifying 
such deficiencies. A prime example is the current lack of 
chain models that can quantitatively reflect the extreme 
kinetic and thermodynamic cooperativity of small single- 
domain proteins. ^'^^ This highlights our insufficient un- 
derstanding of protein energetics even at a "big-picture" 
level, suggesting that as heteropolymers natural proteins 
may be quite special in some respects. 



We have recently investigated the severe constraints 
imposed on protein polymer models by the experimental 
observations of calorimetric and other hallmarks of ther- 
modynamic two-state cooperativity. "'^^"■^'^ These cooper- 
ativity requirements appear to be more stringent than 
most other generic protein properties studied so far. A 
variety of flexible heteropolymer models with additive 
residue-based contact energies are able to explain signifi- 
cant aspects of the folding process. ''"^^ But such additive 
models — at least for the several examples evaluated to 
date — are found to be insufficient to satisfy the thermo- 
dynamic cooperativity requirements, even though devi- 
ations from proteinlike thermodynamics can be lessened 
in some instances by enhancing interaction heterogeneity 
through using larger numbers of letters and repulsive en- 
ergies in model alphabets. As far as thermodynamic co- 
operativity is concerned, in scenarios tested thus far, we 
find that proteinlike thermodynamics can arise from non- 
additive cooperative contributions that originate from an 
interplay between local conformational preferences and 
(mostly nonlocal) interactions that favor formation of 
protein cores. 



In the folding literature, an intimate correspondence 
between protein thermodynamics and kinetics has figured 
prominently in thcorctical/'^^^^'' modeling, ^"^^^^ and 
interpretative^'"^ discourses. Therefore, we ask: Given 
that a heteropolymer model has already been constrained 
to satisfy a set of protcinlikc thermodynamic properties, 
to what extent proteinlike kinetic behavior would follow 
automatically? For instance, would such a model be suf- 
ficient for two-state kinetics as observed for many small 
single-domain proteins^? More generally, what improve- 
ment in proteinlike kinetics would such a model enjoy 
over other models that are now known to be thermody- 
namically less cooperative? 

In analyses of folding/unfolding kinetics experiments, 
free energy profiles arc used extensively to provide useful 
insight^''^'^^"^'^ and as a picturesque device to summarize 
data. However, other than the folding free energy and 
rate measurements themselves, independent experimen- 
tal techniques to accurately define and determine such 
profiles are currently lacking. Moreover, many of these 
profiles were proposed without considering explicit chain 
representations. Therefore, the applicability and gener- 
ality of their implied physical pictures remain to be as- 
certained. Coarse-grained protein chain models are well 
suited to shed light on this fundamental issue because 
they allow for broad conformational sampling. Free en- 
ergy profiles in coarse-grained models can be obtained 
directly from chain population distributions, without re- 
gard to (and therefore independent of) kinetic rates. It 
follows that a rigorous evaluation of the applicability of 
transition state theory to protein folding can be con- 
ducted by comparing the transition-state-predicted rates 
and the actual kinetic rates in these models. We study 
one such model below. 

II. A MODEL FOR THERMODYNAMIC 
COOPERATIVITY 

The present analysis is based on a thermodynamically 
cooperative 55mer lattice protein model that folds to a 
groimd state with a four- helix core (Fig. 1). The intra- 
chain interaction scheme includes additive 5-letter con- 
tact energies,^'^'^^ repulsive interactions disfavoring left- 
handed helices and sharp turns at the end of a helix, 
as well as cooperative "native hydrogen bond burial" 
terms^° (c.f. refs. 35, 36). The total energy E is de- 
fined by Eq. (1) in ref. 20. Although "native-centric" 
interactions were introduced to enhance thermodynamic 
cooperativity in the present model, unlike the usual Go 
construction, they arc not necessary for recognizing the 
ground state. This is because the general, non-native- 
centric terms in the model (all terms in E except the 
"native hydrogen bond burial" terms) are sufficient to 
provide global favorability to the proteinlike four-helix 
ground state. We note that several other studies^^"^^ 



have also emphasized cooperative interactions in protein 
folding; and nonadditive aspects of hydrophobic effects 
are being explored. As we have emphasized,^" al- 
though the present model is useful for exploring the is- 
sues at hand, it should be regarded as tentative, partly 
because it does not provide an explicit account of other 
possible physical origins of protein thermodynamic coop- 
erativity such as sidechain packing. Furthermore, 
in view of the current lack of definitive understanding 
of hydrogen bonding energetics (see discussion and refer- 
ences in ref. 20), the cooperative "native hydrogen bond 
burial" energy in the present model should be broadly in- 
terpreted as representative of a general favorable coupling 
between local conformational preference and formation of 
proximate tertiary contacts, the physical mechanisms of 
which remain to be further elucidated. 




FIG. 1. Thermodynamic stabilities and definitions of na- 
tive and denatured states. One of the eight iso-energetic 
ground-state conformations is shown in the inset, where 

black beads denote nominally hydrophobic residues.^" Free 
energies of unfolding AG = kBT\n{Pj<j/Pr>). Solid curves 
(labeled by Et) classify conformations with E < Et and 
E > Et as native and denatured, respectively. Dashed curves 
show the free energy of denatured conformations, defined as 
those with E greater than the values shown, relative to the 
ground-state-only native state with Et = —52.04. All results 
presented in this paper were obtained using model energetic 
parameters 7ih = 6.0, 7st ~ 5.0, fneiix = 0, bSub = —0.8, and 
b = 1.5 as specified ref. 20. 

In addition to the formulation in ref. 20, here we intro- 
duce a parameter e to model protein behaviors at differ- 
ent intrachain interaction strengths, such that the effec- 
tive energy of a conformation with energy E is equal to 
—eE, hence its Boltzmann weight equals exp{eE/kBT), 
where ksT is Boltzmann's constant times absolute tem- 



peraturc. It follows that the partition function Q = 
^^g{E)exp{€E/kBT), where g{E) is the number of 
conformations with energy and g{E) is estimated by 
a parameter-space Monte Carlo histogram technique.^" 
The formulation in ref. 20 corresponds to e = —1. Be- 
cause of the peculiar and significant temperature de- 
pendence of the solvent-mediated interactions in real 
proteins, varying —e/ksT serves better as a model for 
how effective intrachain interactions arc modulated at 
constant temperature by denaturant concentration'** or 
denaturant activity^^ than for how Boltzmann weights 
changes with temperature. -'^'^'''^"''''^ Here, as a first ap- 
proximation, denaturant effects are simply taken to be 
uniform over different interaction types. Models with 
different denaturant effects on different interaction types 
remain to be explored. 
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FIG. 2. Free energies profiles are given by negative log- 
arithmic distributions of energy (solid curves), plotted here 
for the e/fesT's shown. P{E) is the sum of Boltzmann 
weights of conformations with energies E' in the range 
E - 0.5 < E' < E + 0.5. The vertical dashed line marks 
the E = —34 free energy peak for e/kaT = —1.56. The inset 
shows the peak region of this profile, where lines joining a pair 
of open circles [— In P(_E) vahies] record all single-chain-move 
interconvorsions between a conformation with E < —34 and 
one with E > —33 in our simulation. These kinetic connec- 
tions suggest identifying the shaded area (—38 < E < —30) 
as a transition state region. 

As we have discussed from a general polymer 
perspective,"'^^ matching theoretical considerations with 
the experimental practice of calorimetric baseline sub- 
tractions necessitate a multiple-conformation native state 
that entails considerable fluctuations beyond small- 
amplitude vibrations. Here we further investigate 
the implications of native-state conformational diver- 



sity. To that end, we study different definition of 
"native" and "denatured" states by assigning differ- 
ent values for a "transition" energy Et demarcating 
the native and denatured ensembles, such that Pn 
= '^^^^^ g{E) eyi^{eE / ksT) / Q is the fractional native 
population and Pd = 1 ~ Pn is the fractional denatured 
population (Fig. 1). For each of these definitions to be 
tested, a range of energies is spanned by the native state, 
except for the special case when is chosen to be equal 
to the ground-state energy. Remarkably, despite the dif- 
ferences in the definitions of "native" and "denatured" 
states, the thermodynamic (AG ~ 0) transition mid- 
points of the different EtS in Fig. 1 are very similar, 
all at e/ksT » -1.56. 

Stabilities of different denatured ensembles relative to 
that of the ground state are shown in Fig. 1 (dashed 
curves). These quantities correspond roughly to native 
state hydrogen exchange (HX) free energies, (sec 
also ref. 55), for it is reasonable to expect that certain 
amides become exposed and exchangeable when the ef- 
fective energy of a conformation is above a certain thresh- 
old. Our results share the same general trend as that ob- 
served in these experiments, suggesting that some 
of the fluctuations observed by HX may be considered as 
part of a multiple-conformation native state. **'*^'^^ Fig- 
ure 1 indicates that linear extrapolation of AG from the 
transition region to the strongly native regime (more neg- 
ative e/ksT) is only valid for the free energy difference 
between the set of "fully unfolded" open conformations 
and the ground state (top dashed curve). 

III. FREE ENERGY PROFILES AND CHEVRON 
PLOTS 

Consistent with the model's thermodynamically two- 
state character,^" its energy distribution is bimodal under 
denaturing conditions and moderately native conditions 
(Fig. 2), although the distribution becomes one-sided or 
"downhill" '"'^'''^'^ under strongly native conditions (e.g., 
when e/ksT = —2.0). We emphasize that here the de- 
termination of InP(P) is entirely independent of any ki- 
netic consideration. Therefore, the present \nP{E) func- 
tion reflects the actual thermodynamics of the model. 
As such, its physical origin is fundamentally different 
from free energy profiles that have been empirically con- 
structed or postulated to fit rate data. We can therefore 
use these lnP(i?)'s to assess the transition state picture, 
with E as the reaction coordinate. Different reaction co- 
ordinates have been used in other investigations. 

We employ standard Metropolis Monte Carlo 
dynamics^^'^^ to explore physically plausible kinetic 
scenarios, using the number of attempted moves as 
the model time. This approach has been proven 
usefuF"^-''^^'^^ despite its obvious hmitations.^^'-'^'^^ 
The present set of elementary chain moves consists of end 



flips, corner flips, crankshafts, rigid rotations, and 
local moves that transform two turns of a right-handed 
helix among its three possible orientations while holding 
its two end monomers fixed. The relative frequencies of 
attempting these moves are 2.3%, 27%, 60.6%, 10%, and 
0.1%, respectively. Some chain moves can lead to large 
changes in energy, hence movements along the model free 
energy profile need not be continuous (inset of Fig. 2). 
Therefore, it is more justified to regard the transition 
state as a region rather than a single highest point along 
this particular reaction coordinate. ^^'^^'^'^ The group of 
conformations represented by the shaded area in Fig. 2 
clearly serves the role of a transition state because all 
conformational interconvertions between the native and 
unfolded sides of the population distribution must pass 
through one or more conformations in the shaded area. 
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FIG. 3. Model chevron plots. Average logarithmic rates 
are given by negative logarithms of mean first passage time 
(MFPT). Each folding trajectory starts from a randomly 
generated conformation; unfolding trajectories are initiated 
from the ground state. Each data point is averaged from 
~ 50-1, 000 trajectories. Solid curves through data points are 
mere guides for the eye. Larger squares on the right show un- 
folding MFPT's for attaining energies E > —34 (open) and 
E > -A (filled). Unfolding MFPT's for E > -10 arc essen- 
tially identical to that for E > —4. Other data points (on 
the left) show folding MFPT's for reaching {from top to bot- 
tom) E < -34, -40, -42, -44, -46, and the ground state. 
The vertical dashed lino is the approximate transition mid- 
point. The inclined dashod-dottod lino shows folding rates if 
kinetics were two-state for the present model, an hypothetical 
situation in which the ground state thermodynamic stability 
relative to the fully unfolded conformations (c.f dashed line 
labeled by "—10" in Fig. 1) is given by the difference between 
logarithmic folding rates (dashed-dotted line) and unfolding 
rates extrapolated from the solid squares. 



Figure 3 reports simulated mean first passage 
times^®'^'^ for a range of intraprotein interaction strength 
on both sides of the transition midpoint, in a format iden- 
tical to typical experimental chevron plots. 
We explore a variety of kinetic folding and unfolding cri- 
teria by monitoring the time it takes for the chain to 
first cross several different "finish lines." This results in 
an appreciable variation in apparent rates under strongly 
native conditions (Fig. 3, more negative e/fc^T). Simi- 
lar effects may be operative when multiple experimental 
probes are used to monitor kinetics. 

The trajectory in Fig. 4 (upper panel) depicts the 
model's heuristically "two-state" behavior at the tran- 
sition midpoint. For the two chain properties shown, na- 
tive and denatured parts of the trajectory can be easily 
discerned, with very little time spent in between; strongly 
suggesting that the kinetics is first order. Fluctuations 
in E is considerable within the native (low E) part of 
the trajectory, underscoring the utility and necessity of 
a multiple-conformation native state (see Fig. 1 and be- 
low). Another facet of the native-denatured intercon- 
version is provided by the Rg trace. Consistent with a 
recent kinetic Rg measurement on a small protein, it 
shows that the chain undergoes sharp kinetic transitions 
between a native state that has minimal fiuctuations in 
Rg and a denatured state that spans a wide range of i?g's. 

A more quantitative test introduced by Gutin et al.^'' 
is performed in the lower panel of Fig. 4. It indicates 
that folding kinetics is essentially first order (i.e., single- 
exponential) for an extended range of model intrapro- 
tein interaction strength, covering moderately native con- 
ditions (e/fcsT —1.80) through conditions that are 
less favorable to folding (less negative e/fc^T), although 
deviations from single-exponential behavior occur un- 
der strongly native conditions in the model (e/fcsT < 
— 1.85). Using the same technique, imfolding kinetics 
(Fig. 3) is found to be essentially single-exponential for 
the entire range of unfolding e/fc^T investigated (detailed 
results not shown). We have confirmed these conclusions 
by analyzing first passage time (FPT) distributions as 
in ref. 68 at several e/fcsT's, paying special attention to 
folding kinetics under moderately native conditions that 
are not far from the onset of drastic chevron rollover and 
non-single-exponential behavior (Fig. 3). For example, 
we have obtained the logarithmic FPT distribution at 
e/ksT = —1.72 by binning 1,080 simulated trajectories 
into time slots of 10^, and found that 98% of these tra- 
jectories can be fitted by a single exponential with a cor- 
relation coefficient r = 0.95. If one assumes that the unit 
model time needed for each elementary chain move cor- 
responds roughly to a real time scale of 10""'^'^ 10~^ sec 
(ref. 69), the fastest model folding rate in Fig. 3 is in the 
order of 10^-10^ sec'^ 

The contrast between the present model and its cor- 
responding Go model is intriguing. We have shown that 
the Go model in Fig. 4 is thermodynamically significantly 



less cooper ative,^° yet it folds faster than our model. 
This scenario of a negative correlation between folding 
speed and thermodynamic coopcrativity may bear on the 
issue of folding rate overestimation in folding theories 
that use Go-like potentials. It also raises a more basic 
question as to whether and when the Go prescription is 
sufficiently adequate for capturing minimal frustration^^ 
mechanisms in real proteins. 



chemistry at zero denaturant. It follows that the experi- 
mental zero-denaturant situation for most proteins most 
likely corresponds to e/ksT > —1.80 in Fig. 3. It would 
be interesting to explore whether special experimental 
situations corresponding to the very strongly folding con- 
ditions in the model can be found for some proteins such 
that similar drastic chevron rollovers can be observed. 
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FIG. 4. Upper panel: A typical trajectory at 
e/ksT = —1.56. Rg is radius of gyration in units of lat- 
tice bond length. Lower panel: Folding MFPT's (filled cir- 
cles) through the ground state for the model in Fig. 3 (lower 
curves) are compared to that for a Go model (upper curves) 
that has the same transition midpoint (vertical dashed line), 
uses the same move set, and assigns a —1.5 energy for every 
contact in the conformation in Fig. 1 and zero energy other- 
wise. Lines through data points are mere guides for the eye. 
Open squares are median first passage times divided by In 2, 
which equals MFPT for single-exponential kinetics. Hence a 
discrepancy between the circles and squares signals a devia- 
tion from single-exponential kinetics.®^ 

The present model is proteinlike in that it pre- 
dicts a mild chevron rollover concomitant with 
single-exponential folding kinetics, consistent with 
experiments. ^^''^^'^^ But the drastic chevron rollover 
(with an appreciable decreasing folding rate with increas- 
ing native conditions, and non-singlc-cxponcntial folding 
kinetics) predicted for strongly native conditions in the 
present model (e/fcsT < —1.85 in Fig. 3) has not been 
documented for real proteins. This suggests that such 
conditions, which coincide with downhill folding^'^^'^* in 
the present model (see above), may not be realizable. If 
so, this is not surprising. Native stability can be arbitrary 
high in the model (— e/fcsT can be arbitrarily large), but 
for real proteins native stability is limited by the actual 



IV. ASSESSING THE TRANSITION STATE 
PICTURE 

Consistent with the thermodynamics of our model, ki- 
netic rate of folding to the ground state and of unfolding 
to an open state meet at the approximate transition mid- 
point determined thermodynamically in Fig. 1 (c.f. the 
lower "V" in Fig. 3). Interestingly, a similar consistency 
is seen by matching rates of crossing the peak of the free 
energy profile in the inset of Fig. 2 in the folding and un- 
folding directions (upper "V" in Fig. 3). Near the tran- 
sition midpoint, folding rates defined by crossing several 
different finish lines at low but non-ground-state energies 
are very close to one another, and are only slightly faster 
than the rate of folding to the ground state (Fig. 3). This 
implies that, under midpoint to moderately native condi- 
tions, kinetics is rapid once the folding chain has cleared 
the shaded transition state region in Fig. 2 and proceeds 
on to the native side. But the folding rates for different 
finish lines are very different under strongly native con- 
ditions {e/ksT w —2.0), indicative of glassy dynamics 
(Figs. 3 and 4). 

Despite the essentially single-exponential and heuris- 
tically "two-state" kinetics discussed above, the fold- 
ing/unfolding kinetics of the present model differs from 
the strictly two-state variety observed for an increasing 
number of small single-domain protcins*'^'^'*'^''''™ such 
as a 64-rcsidue form of chymotrypsin inhibitor 2. Fig- 
ure 3 shows that folding rates under moderately to 
strongly native conditions arc slower than that required 
for such strictly thermodynamically and kinetically two- 
state proteins (inclined dashed-dotted line). In fact. 
Fig. 3 is reminiscent of experimental chevron plots with 
rollovers. ■^^'■^^'''^ Examples include wildtype barnase^^ 
and ribonuclease A.^^ Hence we believe the present lat- 
tice construct may serve as a tool for better understand- 
ing the folding kinetics of these proteins. 

How much kinetic information can be inferred from free 
energy profiles such as those in Fig. 2? The conventional 
transition state picture of protein folding^'^^'^° stipulates 
that 



rate = F exp 



kBTj ' 



(1) 



where AG''' is the activation free energy for the kinetic 
process in question. We call AG^/ksT the transition- 
state exponent. F is the pre-exponential front factor^^ 



or prefactor,-^'^ which depends on solvent viscosity (not 
considered here) but is often taken to be insensitive to 
intraprotein interaction strength.^" Figure 5 examines 
whether this picture applies to the present model. It does 
so by investigating the dependence of F on e/ksT. For 
the sake of generality, several physically motivated P{E)- 
based definitions of "transition state," "folded state" and 
"unfolded state" are evaluated. In the tests conducted 
here, "transition state" is defined by cither the shaded 
area in Fig. 2 (-38 < E < -30) or £; = -34 (peak of 
barrier); "folded state" is defined by the ground state 
only {E = —52) or by _E < — 34 (left of the barrier); and 
"unfolded state" is defined by the approximate position 
of the denatured free energy minimum {E = —4), by the 
bulk of the open conformations (i? > — 10), or by > 
—34 (right of the barrier). Transition-state exponents for 
folding and unfolding are then computed, respectively, by 
the [transition/unfolded] and [transition/folded] popula- 
tion ratios at the given e/fcsT (Fig. 5). 

The scaling of F with respect to e/ksT is found to be 
not sensitive to these variations in definition (Fig. 5, up- 
per panel). Our results show that the simple transition 
state picture does not apply to folding in this model. For 
the quasi-linear part of the chevron plot in Fig. 3, the 
relationship between AG^/ksT and - In(MFPT) is ap- 
proximately linear (Fig. 5), with slope w —1.5 (filled sym- 
bols) or « —1.9 (open symbols). This implies that F has 
approximately the same functional form as the activation 
factor in this regime, but with an exponent of opposite 
sign, VIZ., F - cxp{- fjAG^/kBT), where fi « -0.33 for 
folding to the ground state (filled symbols). On the other 
hand, unfolding appears to be well described by the sim- 
ple transition state picture. The corresponding slope for 
unfolding Ri —1.0, hence F fa constant. Folding and 
unfolding front factors are approximately equal near the 
thermodynamic midpoint (Fig. 5, lower panel), reflecting 
the fact that in that region folding and unfolding rates 
are essentially equal (Fig. 3). The role of front factors 
in understanding folding rates has recently been empha- 
sized by Portman et al.^^ The pattern in Fig. 5 is similar 
in some respects to the results of a recent model study 
by Nymeyer ct al.^^ Using a different reaction coordi- 
nate for 2-, 3-letter and Go 27mer models, these authors 
found approximate linear relations with non-unity slopes 
between kinetically simulated rates and rate quantities 
deduced from free energy profiles, although they did not 
consider a broad range of overall interaction strengths as 
that in chevron plots. 

Figure 5 shows that F for folding decreases with in- 
creasingly native conditions. F may be identified as an ef- 
fective diffusion coefficient. It corresponds to an internal 
friction term arising from the impediment to motion im- 
posed by the chain segments on one another. ^■'^'^^'^'^'^^"^^ 
Plaxco and Baker^^ have experimentally investigated in- 
ternal friction in protein folding, and concluded insight- 
fully that internal friction effects are limited for strictly 



two-state proteins. But the functional form they con- 
sidered is different from the novel approximate exponen- 
tial form F ^ exp(— /iAG-f/fegT) discovered here for the 
quasi-linear part of chevron plots and the part with a 
mild rollover (corresponding to a limited range of e/ksT) 
for proteins that are not kinetically two-state. In the 
present model, when a wider range of e/ksT is considered 
(Fig. 5, lower panel), the folding InF reveals its nonlin- 
ear character, a feature anticipated by energy landscape 
theory^^'^^ and consistent with a pioneering simulation 
study of Socci et al.^° 



10 

8 



i 4 

.S o 

OA 

■r 

■g 6 

a 4 

2 




Unfolding 



O 

-20 



-19 -18 



-17 -16 -15 
-ln(MFPT) 



-14 -13 



-2 
-1.8 
-1.6 
-1.4 
-1.2 



-25 



-20 -15 
In (front factor) 



-10 



FIG. 5. Correlations between rates and transition-state ex- 
ponents. Filled symbols are for reaching the ground state 

in folding and reaching E > —4 in unfolding, open symbols 
are for crossing the — In P{E) peak at i5 = —34 in cither 
directions. Upper panel: Folding transition-state expo- 
nent AG^/ksT = -In[P(-38 < E < -30) /P{E > -10)] 
(circles), -ln[P(-38 < E < -30)/P{E > -34)] (tri- 
angles), -ln[P(£; = -34)/P(£; = -4)] (squares), or 
-ln[P(£; = -M)/P{E > -10)] (diamonds). Data 
points shown arc for e/ksT > —1.75. For unfolding, 
AG^kBT = -ln[P(-38 < E < -30)/P{E = -52)] 
(squares), -ln[P(-38 < E < -30)/P{E < -34)] (cir- 
cles), or - ln[P{E = -34)/P(£ = -52)] (diamonds). The 
straiglit lines arc fitted. Lower paneh InF = — In(MFPT) 
-I- AG^/fcsr (horizontal variable) vs. e/ksT. Filled and open 
circles (for folding) and squares (for unfolding) identify the 
folding and unfolding AG*//cbT used, as defined in the up- 
per panel. The dashed line marks the approximate transition 
midpoint. 



V. THE CALORIMETRIC CONNECTION: WHAT 
IS THE NATIVE STATE? 

A thermodynamic consideration of the model's free en- 
ergy profiles (Fig. 2) and the above kinetic analysis 



suggest that a natural way to define the "native" and 
"denatured" states is to have their demarcation hne at 
Ef = —34. Figure 6 investigates the calorimctric impli- 
cations of different Et choices. As a first test of prin- 
ciples, in this section we take the effective intraprotein 
energies as temperature independent. Our conclusions 
are not expected to be significantly altered by the incor- 
poration of proteinlike temperature-dependent effective 
interactions.^^ 
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FIG. 6. Heat capacity vs. temperature (from ref. 20). Here 
nonlinear denatured (high T) dashed baselines"'^^'^"' arc com- 
puted for {from top to bottom) Et = -52.04, -48, -42, -38, 
and —34 (c.f Fig. 1). Corresponding native baselines are 
plotted but are too close to one another to be distinguishable. 
The vertical dotted line marks the approximate transition 
midpoint used throughout this work. Also shown are exam- 
ple compact non-ground-state conformations with E = —36.0 
(top) and E = —46.0 (bottom). The beads mark monomers 
that are not in their folded environment, i.e., do not have 
their full set of contacts as that in the ground state. 

We have argued that empirical calorimctric baseline 
subtractions correspond essentially to an operational def- 
inition of the native and denatured ensembles. The de- 
marcation energy (or enthalpy) between the native and 
denatured states may be ascertained by matching em- 
pirical baselines^® to the nonlinear "formal two-state" 
baselines of Zhou et al.^^ Figure 6 shows that the base- 
lines from empirically extrapolating the native and de- 
natured tails of the heat capacity curve of the present 
model^° essentially coincide with the formal two-state 
baselines for Et = —34, implying that by adopting such 
empirical baselines Et = —34 is effectively adopted. In 
that case, the two chain conformations shown in Fig. 6 
would belong to the native state and therefore may be 
regarded as sitting "below the calorimetric baseline." 



These chains may model the sparsely populated confor- 
mations revealed by native-state HX.^^~^^ A multiple- 
conformation native state view is supported by a re- 
cent experimental observation of pretransitional confor- 
mational changes in ribonuclease A.''^ 

VI. DISCUSSION 

The logic of the present analysis is premised on a com- 
parison between simulated folding/unfolding rates and 
transition-state predictions based on independently ob- 
tained free energy profiles. The conventional transition 
state picture of folding posits a weak or nonexistent de- 
pendence of the front factor on a protein's intrachain in- 
teraction strength. In the present model, which exhibits 
a chevron rollover, we find that the conventional picture 
holds approximately for unfolding but not for folding. In 
particular, for the quasi-linear part of the folding arm 
of the chevron plot, the folding front factor adopts a re- 
scalcd form of the exponential factor, harboring an ex- 
ponent opposite in sign to that of the activation term. 
These findings are consistent with internal friction and 
diffusive folding dynamics ideas from energy landscape 
theory. They suggest that simple transition state theory 
with a constant front factor may not be generally appli- 
cable in the presence of a chevron rollover, even if the 
kinetics is apparently first order. In this regard, future 
single-molecule measurements^'' of folding time distribu- 
tions may provide important insight into the physics un- 
derlying approximate single-exponential folding kinetics, 
since these measurements may detect small deviations 
from first-order kinctics"'^'^ (e.g., a possible small non- 
exponential tail in the distribution) that would otherwise 
be difficult to ascertain from traditional measurements of 
ensemble-averaged folding rates. 

Chevron rollovers have been rationalized by dead- 
time discrete intermediates^^ and by movements of the 
transition-state peak on broad activation barriers."'^ We 
have not been able to detect these proposed features in 
our model free energy profiles. Instead, the present re- 
sults offer an alternate rationalization in terms of diffu- 
sive dynamics and an interaction-dependent folding front 
factor. It follows that, in general, analyses that focus 
exclusively on free energy profiles may be incomplete. 
Inasmuch as chevron rollovers are a manifestation of an 
interaction-dependent front factor, as suggested here, ex- 
perimental observations of significant mutational effects 
on rollover behavior^^ imply that mutations can have a 
significant effect not only on the free energy profile itself, 
but also on front factors not afforded by such profiles. 

The present account of salient features of chevron 
rollover and native-state HX in terms of an essentially 
continuous energy distribution (Fig. 3) is similar in spirit 
to the recent idea that these features may originate 
from a "burst phase continuum." ^^'^^ However, the burst 



phase continuum view is based on postulated free energy 
profiles, not free energy profiles derived from models with 
explicit chain representations. Further effort will be re- 
quired to elucidate the relationship between the burst 
phase continuum and the present chain-based perspec- 
tives, as there arc apparent differences between the two. 
For instance, the present study suggests that some of the 
states detected by native-state HX are on the native side 
of the conformational distribution (Figs. 3 and 6) rather 
than on the denatured side as envisioned by the burst 
phase continuum scenario. 

In summary, we emphasize that while the current 
study proposes a new physical rationalization for chevron 
rollover, it does not by itself rule out other mechanisms. 
Chevron rollovers in real proteins may arise from a combi- 
nation of effects. Obviously, the generality of the present 
interaction-dependent front factor scenario should be fur- 
ther tested using model proteins with non-helical native 
topologies as well as using geometrically more realistic 
off-lattice continuum models. ^^'^^ 

As for the relationship between generic features of 
folding/ unfolding kinetics and thermodynamics of small 
globular proteins, the qualitative agreement between 
Fig. 3 and typical chevron rollover plots for real proteins 
supports the idea that proteinlike thermodynamics nec- 
essarily lead to proteinlike folding/unfolding kinetics. A 
case in point is the folding kinetics of a set of 20-letter 
model sequences reported by Gutin et al.^'' Our test cal- 
culations show that random sequences of this particular 
20-lcttcr alphabet with additive contact energies are not 
calorimetrically two-state (data not shown). Although 
much useful insight has been gained from them^°'^^'^^ 
(see also rcf. 16), recent calculations'^^ indicate that 
even some designed sequences in this 20-letter model 
are thermodynamically less cooperative than the present 
model. Apparently, the folding kinetics of these 20- 
letter model sequences are less proteinlike as a result, 
in that their folding rates decrease when native stability 
is increased from the transition midpoint (Figs. 2, 5, 7 
and 8 in ref. 67). This is because the maximum folding 
rates and the onset of drastic rollover in these 20-letter 
models occur around the thermodynamic transition mid- 
point, rather than under strongly native conditions as in 
the cooperative model studied here (Fig. 3 of the present 
work). Thus, the chevron trend predicted by these 20- 
letter models under transition midpoint through mod- 
erately folding conditions is opposite to that observed 
experimentally, '*^''''' because experiments almost invari- 
ably show an increasing folding rate when native stability 
is increased from the transition midpoint. 

The present model's kinetics is proteinlike but not two- 
state. In this respect, it is reassuring that the exercise 
here fares no worse than Nature's. This is because a 
protein's calorimetric two-state cooperativity,^^ such as 
that of hen lysozyme,^^'^^ barnase,^^ and ribonuclease 
A,^^ is no guarantee for two-state kinetics.^'^ However, 



the present exercise does suggest that additional or al- 
ternate interaction mechanisms have to be discovered to 
account for the strictly two-state behavior of many small 
single-domain proteins. In that regard, it would be inter- 
esting to investigate the connection between the strictly 
two-state proteins' apparently nonglassy kinetics^" and 
the possibility that their front factors might be mini- 
mally sensitive to the variation in intrachain interaction 
strength. 
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